智能论文笔记

Reconfigurable Intelligent Surface-assisted Classification of Modulations using Deep Learning

Mir Lodro , Hamidreza Taghvaee , Jean Baptiste Gros , Steve Greedy , Geofrroy Lerosey , Gabriele Gradoni

分类：人工智能

2022-09-17

无线网络的第五生成（5G）将更加自适应和异质。可重新配置的智能表面技术使5G能够在多仪波形上工作。但是，在这样的动态网络中，特定调制类型的识别至关重要。我们提出了基于人工智能的RIS辅助数字分类方法。我们培训卷积神经网络以对数字调制进行分类。所提出的方法可以直接在接收的信号上学习并学习特征，而无需提取功能。介绍和分析了卷积神经网络学到的功能。此外，还研究了在特定SNR范围内接收信号的强大功能。发现所提出的分类方法的准确性很显着，尤其是对于低水平的SNR。

translated by 谷歌翻译

Semantically Enhanced Global Reasoning for Semantic Segmentation

Mir Rayat Imtiaz Hossain , Leonid Sigal , James J. Little

分类：计算机视觉 | 机器学习

2022-12-06

Recent advances in pixel-level tasks (e.g., segmentation) illustrate the benefit of long-range interactions between aggregated region-based representations that can enhance local features. However, such pixel-to-region associations and the resulting representation, which often take the form of attention, cannot model the underlying semantic structure of the scene (e.g., individual objects and, by extension, their interactions). In this work, we take a step toward addressing this limitation. Specifically, we propose an architecture where we learn to project image features into latent region representations and perform global reasoning across them, using a transformer, to produce contextualized and scene-consistent representations that are then fused with original pixel-level features. Our design enables the latent regions to represent semantically meaningful concepts, by ensuring that activated regions are spatially disjoint and unions of such regions correspond to connected object segments. The resulting semantic global reasoning (SGR) is end-to-end trainable and can be combined with any semantic segmentation framework and backbone. Combining SGR with DeepLabV3 results in a semantic segmentation performance that is competitive to the state-of-the-art, while resulting in more semantically interpretable and diverse region representations, which we show can effectively transfer to detection and instance segmentation. Further, we propose a new metric that allows us to measure the semantics of representations at both the object class and instance level.

translated by 谷歌翻译

A Sequence Agnostic Multimodal Preprocessing for Clogged Blood Vessel Detection in Alzheimer's Diagnosis

Partho Ghosh , Md. Abrar Istiak , Mir Sayeed Mohammad , Swapnil Saha , Uday Kamal

分类：计算机视觉

2022-11-06

Successful identification of blood vessel blockage is a crucial step for Alzheimer's disease diagnosis. These blocks can be identified from the spatial and time-depth variable Two-Photon Excitation Microscopy (TPEF) images of the brain blood vessels using machine learning methods. In this study, we propose several preprocessing schemes to improve the performance of these methods. Our method includes 3D-point cloud data extraction from image modality and their feature-space fusion to leverage complementary information inherent in different modalities. We also enforce the learned representation to be sequence-order invariant by utilizing bi-direction dataflow. Experimental results on The Clog Loss dataset show that our proposed method consistently outperforms the state-of-the-art preprocessing methods in stalled and non-stalled vessel classification.

translated by 谷歌翻译

The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling

Yusong Wu , Josh Gardner , Ethan Manilow , Ian Simon , Curtis Hawthorne , Jesse Engel

分类：机器学习

2022-09-28

数据是现代机器学习系统的命脉，包括音乐信息检索中的命脉（MIR）。但是，MIR长期以来一直被小型数据集和不可靠的标签所困扰。在这项工作中，我们建议使用生成建模打破这种瓶颈。通过使用室内合奏的结构化合成模型（在URMP上训练的MIDI-DDSP）的结构化合成模型，通过管道说明（在巴赫合唱上训练的椰子）模型，我们演示了一个能够生成无限量的逼真的合唱音乐的系统，其中包括丰富的结合音乐，包括混合，包括混合，，，包括混合，茎，MIDI，笔记级性能属性（Staccato，Vibrato等），甚至是细粒的合成参数（音高，振幅等）。我们称此系统为室内集合发生器（CEG），并使用它来生成来自四个不同腔室合奏（cocochorales）的大型合唱数据集。我们证明，使用我们的方法生成的数据改善了音乐转录和源分离的最新模型，并且我们均发布了系统和数据集作为MIR社区未来工作的开源基础。

translated by 谷歌翻译

Unsupervised Ensemble Based Deep Learning Approach for Attack Detection in IoT Network

Mir Shahnawaz Ahmed , Shahid Mehraj Shah

分类：机器学习

2022-07-16

物联网（物联网）通过通过互联网控制设备/事物来改变生活。物联网已为日常问题指定了许多智能解决方案，将网络物理系统（CPS）和其他经典领域转化为智能区域。构成物联网的大多数边缘设备具有极低的处理能力。为了降低物联网网络，攻击者可以利用这些设备进行各种网络攻击。此外，随着越来越多的物联网设备的添加，新的和未知威胁的潜力呈指数增长。因此，必须开发针对可以识别此类威胁的物联网网络的智能安全框架。在本文中，我们开发了一种无监督的集合学习模型，该模型能够从未标记的数据集中检测物联网中的新或未知攻击。系统生成的标记数据集用于训练深度学习模型以检测IoT网络攻击。此外，研究提出了一种特征选择机制，用于识别数据集中最相关的方面以检测攻击。该研究表明，建议的模型能够识别未标记的物联网网络数据集和DBN（深信念网络）的表现优于其他模型，检测准确性为97.5％，错误警报率为2.3％，当使用由标记的数据集进行培训时建议的方法。

translated by 谷歌翻译

Artificial Intelligence-Assisted Optimization and Multiphase Analysis of Polygon PEM Fuel Cells

Ali Jabbary , Nader Pourmahmoud , Mir Ali Asghar Abdollahi , Marc A. Rosen

分类：神经与进化计算 | 机器学习

2022-04-10

本文介绍了新的六角形和五角形PEM燃料电池模型。在实现了改善的细胞性能后，这些模型已得到了优化。多目标优化算法的输入参数是入口处的压力和温度，消耗和输出功率是客观参数。数值模拟的输出数据已使用深神经网络训练，然后以多项式回归进行建模。已使用RSM（响应表面方法）提取目标函数，并使用多目标遗传算法（NSGA-II）优化了目标。与基本模型相比，优化的五角大楼和六边形模型分别将输出电流密度增加21.8％和39.9％。

translated by 谷歌翻译

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Kaustubh D. Dhole , Varun Gangal , Sebastian Gehrmann , Aadesh Gupta , Zhenhao Li , Saad Mahamood , Abinaya Mahendiran , Simon Mille , Ashish Srivastava , Samson Tan

分类：自然语言处理 | 人工智能 | 机器学习

2021-12-06

数据增强是自然语言处理（NLP）模型的鲁棒性评估的重要组成部分，以及增强他们培训的数据的多样性。在本文中，我们呈现NL-Cogmenter，这是一种新的参与式Python的自然语言增强框架，它支持创建两个转换（对数据的修改）和过滤器（根据特定功能的数据拆分）。我们描述了框架和初始的117个变换和23个过滤器，用于各种自然语言任务。我们通过使用其几个转换来分析流行自然语言模型的鲁棒性来证明NL-Upmenter的功效。基础架构，Datacards和稳健性分析结果在NL-Augmenter存储库上公开可用（\ url {https://github.com/gem-benchmark/nl-augmenter}）。

translated by 谷歌翻译

Traffic-Net: 3D Traffic Monitoring Using a Single Camera

Mahdi Rezaei , Mohsen Azarmi , Farzam Mohammad Pour Mir

分类：计算机视觉 | 人工智能 | 机器学习

2021-09-19

计算机视觉在智能运输系统（ITS）和交通监视中发挥了重要作用。除了快速增长的自动化车辆和拥挤的城市外，通过实施深层神经网络的实施，可以使用视频监视基础架构进行自动和高级交通管理系统（ATM）。在这项研究中，我们为实时交通监控提供了一个实用的平台，包括3D车辆/行人检测，速度检测，轨迹估算，拥塞检测以及监视车辆和行人的相互作用，都使用单个CCTV交通摄像头。我们适应了定制的Yolov5深神经网络模型，用于车辆/行人检测和增强的排序跟踪算法。还开发了基于混合卫星的基于混合卫星的逆透视图（SG-IPM）方法，用于摄像机自动校准，从而导致准确的3D对象检测和可视化。我们还根据短期和长期的时间视频数据流开发了层次结构的交通建模解决方案，以了解脆弱道路使用者的交通流量，瓶颈和危险景点。关于现实世界情景和与最先进的比较的几项实验是使用各种交通监控数据集进行的，包括从高速公路，交叉路口和城市地区收集的MIO-TCD，UA-DETRAC和GRAM-RTM，在不同的照明和城市地区天气状况。

translated by 谷歌翻译

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

Payal Bajaj , Daniel Campos , Nick Craswell , Li Deng , Jianfeng Gao , Xiaodong Liu , Rangan Majumder , Andrew McNamara , Bhaskar Mitra , Tri Nguyen

分类：

2016-11-28

We introduce a large scale MAchine Reading COmprehension dataset, which we name MS MARCO. The dataset comprises of 1,010,916 anonymized questionssampled from Bing's search query logs-each with a human generated answer and 182,669 completely human rewritten generated answers. In addition, the dataset contains 8,841,823 passages-extracted from 3,563,535 web documents retrieved by Bing-that provide the information necessary for curating the natural language answers. A question in the MS MARCO dataset may have multiple answers or no answers at all. Using this dataset, we propose three different tasks with varying levels of difficulty: (i) predict if a question is answerable given a set of context passages, and extract and synthesize the answer as a human would (ii) generate a well-formed answer (if possible) based on the context passages that can be understood with the question and passage context, and finally (iii) rank a set of retrieved passages given a question. The size of the dataset and the fact that the questions are derived from real user search queries distinguishes MS MARCO from other well-known publicly available datasets for machine reading comprehension and question-answering. We believe that the scale and the real-world nature of this dataset makes it attractive for benchmarking machine reading comprehension and question-answering models.

translated by 谷歌翻译